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Abstract — The optimization of the transmit parameters (power 
allocation and steering vectors) for the MIMO BC under general 
linear constraints is treated under the optimal DPC coding strat- 
egy and the simple suboptimal linear zero-forcing beamforming 
strategy. In the case of DPC, we show that "SINR duality" 
and "min-max duality" yield the same dual MAC problem, and 
compare two alternatives for its efficient solution. In the case of 
zero-forcing beamforming, we provide a new efficient algorithm 
based on the direct optimization of a generalized inverse matrix. 
In both cases, the algorithms presented here address the problems 
in the most general form and can be applied to special cases 
previously considered, such as per-antenna and per-group of 
antennas power constraints, "forbidden interference direction" 
constraints, or any combination thereof. 

I. Model and background 

One channel use of the MIMO BC with an Af-antenna 
transmitter and K single-antenna receivers is defined by 



yfc = hfcX + Zfc, k=l,. 



,K 



(1) 



where , x e C are the channel vector of user k and the 
transmitted signal vector, respectively, and Zk ^ CAf{0, 1) is 
AWGN. The relevance of the above model for the downlink of 
a wireless system has been widely discussed. Also, the impact 
of non-ideal channel state information and practical techniques 
for channel estimation and channel state feedback are well- 
understood (see for example [1], [2] and references therein). 
Here, we assume fixed channel vectors perfectly known to 
all terminals and focus on the optimization of the transmitter 
parameters. 

Let S denote a compact set of M x AI covariance matrices. 
The capacity region of the MIMO BC ([TJ subject to the input 
constraint E[xx'^] = Ylx G 5 is given by the set of rate points 
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C(5;hi,...,hK) = coh y U 
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and it is achieved by Dirty-Paper Coding (DPC) where the 
permutation vr of the index set {1, ... , A'} denotes the succes- 
sive encoding order and where the transmit covariance is given 
by Tix = X^sLi ^'=^fe defined by the unit-norm "steering 
vectors" {vk} and by the users transmit powers {qk}- 



The transmitter parameters {vfc}, {qk}, tt, achieving points 
on the boundary of (3(5; hi, . . . , hx), can be determined 
by solving the Weighted Rate Sum Maximization (WSRM) 
problem 
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maximize 



y^^wkRk 
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subject to R e C(5; hi, . . . , h/j-) 
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for some suitable nonnegative weights {wk}- Although a direct 
solution of (|3]l is difficult, for the special case where the 
constraint set S is defined by linear inequalities 



tr(S^*f) < 7^, 



(4) 



where {^i} are positive semidefinite symmetric matrices and 
{je} are non-negative coefficients, the solution of ^ can 
be computed efficiently by solving a sequence of convex 
problems. Explicit algorithms for this computation will be 
presented in Section [II] 

By the Heine-Borel theorem, the compactness of the set S 
implies that S is bounded with respect to the Frobenius norm. 
Hence, without loss of generality, we can always include an 
additional trace constraint tT{Hx) < P for some sufficiently 
large P, without modifying the problem. It should also be 
noticed that Q includes some particularly important special 
cases studied in the literature: for i = 1, 71 = P and 
4>i = I we have the classical sum-power constraint [4]-[7]; 
for L — AI and being all zero but one "1" in the {£,£)-th 
position, we have the per-antenna constraint [8]; for L < M 
and having all zeros but a segment of consecutive "1" on 
the diagonal we have the per-group of antennas constraint [8]; 
for some arbitrary L and rank- 1 = cgc^ we have a general 
"interference" constraint where the unit-vector ci denotes a 
"forbidden" direction along which the transmit power must be 
not larger than 7^ [9]. 

Linear beamforming is a simple precoding strategy that can 
be an attractive alternative to DPC. In this case, the achievable 
rate region has the same form of (|2]l but the encoding order tt 
is irrelevant and the sum in the denominator of the term inside 
the log includes all i ^ k. The optimization of the transmit 
powers {qk}, however, is actually more difficult than with DPC 
since the WSRM problem with linear beamforming has no 
general convex programming equivalent. We shall focus on 
linear Zero-Forcing Beamforming (ZFBF) since in the regime 
of high SNR it is close to optimal and, as we will see, it lends 



itself to an efficient solution. In this case, the WSRM problem 
subject to general linear constraints is given by 



maximize 



subject to 
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We assume K < M and H = [hi,...,]!^] e 
of rank K, otherwise the problem is infeasible. In practical 
applications, the number of users may be larger than the 
number of antennas and some greedy user selection algorithm 
takes care of selecting an "active subset" of size not larger than 
M, but we do not consider this aspect here. Again, a direct 
solution of ^ is difficult. The problem has been addressed 
using convex relaxation and the theory of generalized inverses 
in [10], for the case of per-antenna power constraint and equal 
weights (maximization of the sum-rate). In Section |lll] we 
present a new efficient algorithm that addresses (j5]l in full 
generality. 

II. WSRM ALGORITHMS FOR DPC 

Without loss of generality, assume wi > ■ ■ ■ > wk > 0. 
Then, it is well-known that the optimal DPC encoding order 
is TT = {1, . . . , K}. In [9], using a technique called "SINR 
duality", the following fundamental results are proved. Define 
the "dual MAC" corresponding to ([T]| as the multiple-access 
Gaussian channel 
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y = X! 
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where y,z e C" , z - CJ^{0,'E^{X)) with S;,(A) = 
Efci ^i^e for some vector of non-negative coefficients A > 
and each transmitter has power constraint E[|a;fcp] < pk, 
subject to a total sum-power constraint 

K L 

^Pfc<^A,7£ (7) 

fc=i e=i 

Then, for any A > 0, the value of the original MIMO BC 
WSRM problem is upperbounded by the value of the new 
MAC WSRM problem 

K 

maximize WkRk 
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subject to R e C: 



MAC 



(8) 



where Cmac denotes the capacity region of the dual MAC 
defined above for given parameters A, {^e} and {7^}. The 
solution of (jsjl is achieved by successive decoding in the order 
K,K — 1, . . . , 1, i.e. the reverse of the DPC encoding order. 
Furthermore, the upperbound provided by the dual MAC is 
tight: denoting by g{\) the value of the dual-MAC problem 
for given A, the value of the MIMO BC problem can be 
obtained by minimizing .g(A) with respect to A > 0. Hence, 
the MIMO BC WSRM problem can be solved by iterating 



between one "outer problem" solving the minimization of 
g{X) and an "inner problem" solving (jsjl for fixed A. An 
efficient solution of the inner problem is obtained, with minor 
modifications, using the Lagrange duality approach of [11], as 
done for example in [12]. 

The outer problem can be solved by a subgradient iteration. 
Let A(n) denote the current value of A at step n. Then, the 
next value is given by A(?t. + 1) = X{n) — e„ s(A(n)), 
where s(A(ri)) is a subgradient of g{\) at A = \{n) and 
£ri = £0^^ is the adaptation step for some suitable eoi ^ > 0. 
A subgradient for the problem is given by the vector with 
components [9] s^(A) = 7^ — tr (^^.(A)*^), where Sa;(A) 
denotes the transmit covariance matrix of the MIMO BC 
corresponding to the dual MAC at given A. Intuitively, if the 
£-th constraint is violated, i.e., if 7^ — ti {Tix{X)^i) < 0, the 
corresponding variable must be increased, otherwise, Af 
is decreased. The calculation of the subgradient requires the 
mapping of the solution of the dual MAC (for given A) into 
the corresponding solution (powers and steering vectors) of 
the MIMO BC in order to determine Hx{X.). This is obtained 
by well-known "MAC-to-BC" transformations [6]. 

In [8], the per-antenna power constraint is considered and 
a "min-max duality" approach is used in order to obtain a 
saddle-point convex-concave optimization problem that can be 
solved by an iterative infeasible-start Newton method [13]. 
Following a similar approach, after some algebra omitted here 
for lack of space, we find a min-max dual MAC problem for 
the case of general linear constraints in the form: 



, |Sl(A)+E- = ih,h>,| 

subject to S^(A) = I + Y.'t^i ^t^i, 



(9) 



where P denotes the total sum-power constraint of the MIMO 
BC. As we already argued, a sum-power constraint corre- 
sponding to 4>o = I and 70 = P can always be included 
without loss of generality. It is not difficult to show that the 
optimal value of the corresponding dual variable is Aq = L It 
follows that ([HJ and (|9| are indeed identical. 

The infeasible start Newton method can be used as an 
alternative to the (inner) Lagrange duality - (outer) subgradient 
method reviewed before. Since this algorithm is only briefly 
presented in [8] for the case of per-antenna power constraint, 
and several computation steps are left to the reader, we give 
more details here for the general linear constraint case. First, 
we define the modified objective function for (|9]l 
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where < > is a parameter that controls a "logarithmic barrier" 
term in order to prevent the iterative algorithm to approach the 
boundaries where some elements in p or in A may become 
zero or negative and where we define = Wk — Wk+i with 
wk+i = 0. The logarithmic barrier guarantees that the optimal 
value of the problem can be approached with gap Along 
the iterations, the value t shall be increased in order to make 
this gap as small as desired. 

The problem is convex with respect to A and concave 
with respect to p, with Lagrangian function (neglecting the 
non-negativity constraints and using the modified objective 
function ([T0]|) given by 

C{p, A, ^^) = /t(p, A) - ^ (l^p -P- (11) 

with 7 = (71, . . and p = (pi, . . . ,pkV. The neces- 

sary and sufficient conditions for optimality are given by the 
KKT conditions: 

5/t(p,A) 
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rs = P + 7"^A - l"rp = (12) 

The vector r ~ {rj, rj, ra)^ of dimension K + L + 1 forms 
the "residual" of the KKT equations. The algorithm finds a 
direction and a step for updating the variables (p, A,/i) > 
such that, as the number of iterations grows, the norm of the 
residual tends to zero. The updating direction is given by d = 
— (Vr)~^ r, where the so-called KKT matrix is given by 



Vr = 



r dn 
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Letting for simplicity the vector of variables be denoted by 
x= (p^,A^,/i)^, the algorithm takes the following form: 

1) Fix the algorithm parameters v > 1, and (5 > 0. Initialize 
x(0) to some positive values, let n — 0, and t = \. 

2) Compute the updating direction d(n) at x{n). (see 
explicit expressions of the derivatives given later on). 

3) Update x(?t. + 1) = x(n) + sd{n) where s is found by 
backtracking line search: initialize s = 1 and find s such 
as, while 

||r(x(n) + sd(n))|| > (1 - as) ||r(x(n))|| 

then s ^ (3s, where jS G (0, 1) and a G (0, 1/2) are 
fixed constants, (typical values are a = 0.3 and (3 = 
0.8). 

4) If ||r(x(n+ 1))|| < S, move to the next step, otherwise 
set n ^ ?i + 1 and go back to step 2. 

5) If ^^-j— < S, exit and accept the value of x(n + 1) as 
the final value, otherwise set t ^ vt and n ^ n + 1 and 
go back to step 2. 

Explicit expressions for the elements of the KKT ma- 
trix Vr can be obtained using matrix calculus (see for 
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Fig. 1 . Rate and power convergence behavior of the subgradient method for 
DPC with M = 4 and K = 3 under the sum transmit power and interference 
constraints with L = 2 forbidden directions. 
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Figs. [T] and [2] show a numerical example for M = 4 anten- 
nas, K — 3 users, and unit weights for users = 1, with 
L = 2 forbidden interference directions. We considered a sum- 
power constraint with P = 10 and interference constraints 
with 71 = 72 = 2.5. The same channel vectors and unit- 
norm interference direction vectors cg were used in both plots. 
Fig. [T] and Fig. [2] show the evolution of the objective function 
(sum rate) and of the sum-power and interference constraints 
versus the number of iterations n for the subgradient-based 
algorithm and infeasible start Newton algorithm, respectively. 
Circles in Fig. [T| mark the iterations at which the subgradient 
update is performed, i.e. the start of the new inner problem 
loop. Of course, both algorithms converge to the same optimal 
values and satisfy the given sum-power and interference power 
constraints. However, the infeasible start Newton algorithm 
converges significantly faster 

III. A NOVEL WSRM ALGORITHM FOR ZFBF 

The WSRM problem with ZFBF (j5]l can be reformulated 
in terms of unnormalized transmit matrices (i.e., including the 

'Here Sij denotes Kronecker's delta, equal to 1 if i = j and to otherwise. 
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Fig. 2. Rate and power convergence behavior of the infeasible start Newton 
method for DPC under the same conditions of Fig. [T] 



transmit powers) as 

maximize ^ log (l + h^Tfchfc) 

fc=i 

subject to h^Tfehj =0 V j 7^ fc 

Tfc>0, rank(Tfc) = l, Vfc (14) 



Problem Q is not convex due to the rank-1 constraint. In 
[10] the problem is solved for the equal-weight case and per- 
antenna constraint and it is shown that the convex relaxation 
problem obtained by removing the rank-1 constraints has 
always a rank-1 solution. Following in the footsteps, it is 



easy to show that the same holds for the general case (14i. 
In particular, letting {T^} denote a solution of the relaxed 
problem with possibly rank(TJ|;) > 1 for some k, a rank-1 
solution T/j = t/jtj^ achieving the same optimal value can 
be determined by solving, independently for each user, the 
problem: 



maximize 
subject to 



h^tfc e R+ 

tr(tfct^*,) <ti-(T**,), ye (15) 



We notice that ( 15 1 is a Second-Order Cone Program (SOCP) 



and can be easily solved by standard tools. In the special case 
of per-antenna constraints, treated in [10], ( [T5| ) reduces to a 
linear program. 

Two main issues arise from the convex relaxation approach. 
1) A dramatic dimensionality increase: the relaxed problem 
deals with K symmetric matrices of dimension M x M, 
that is, with KM{M - l)/2 = 0{KAP) variables. 2) Lack 
of an efficient computational method: in [10], the relaxed 



problem for equal weights is cast as a MAXDET for which 
efficient solvers exist. For general weights, the problem is 
not MAXDET and general-purpose convex optimizers must 
be used, with consequent increase of the computation burden. 
In the following we address both issues. 

The zero-forcing constraints h^t^: = for all j k imply 
that the linear precoding matrix T = [ti, . . . , tx] must be a 
right generalized inverse of the channel matrix H'^, i.e., it can 
be expressed in the form 



[gifli, . . . ,gA'afc] + U-^[bi, . . 
G + U^B 



(16) 



where is the normalized (to unit-norm) fc-th column of 
the Moore-Penrose (right) pseudoinverse H(H'^H)^^ of H'^, 
{flfe} are scalar coefficients, U-'- is an orthogonal projector 
onto the orthogonal complement of the span of the channel 
vectors {h/j}, and {b^} are (M — /C) -dimensional vectors of 
coefficients. The direct optimization of the coefficients {a^} 
and {bfc} can be obtained by iterating two steps: 1) for fixed 
steering vectors, optimize the power allocation; 2) for fixed 
relative powers on the pseudo-inverse directions, maximize a 
common scaling factor by optimizing the steering vectors. 

Step 1. Initialize the steering vectors by tfe = g^, cor- 
responding to ak = 1 and b^ — 0, for all k. The ZFBF 
power allocation problem for fixed (not necessarily unit-norm) 
steering vectors is 
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maximize ^ Wk log(l + |h^tfe| 
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subject to: ^ qkt^^itk < 7^, 
fe=i 
q > 



(17) 



Defining the L x K matrix C with {£, k) element [C]^ = 
—t^^etk, the constraint can be written as Cq < 1. The 



Lagrangian for ( 17 1 is 
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£(q, A) = ^ u;fc log(l + Mtkl^qk) - A^Cq + A^l (18) 

where A > is a vector of dual variables. The KKT conditions 
for Qk yield the waterfilling-like solution 

1 



A^cfe 



(19) 



where c^. is the fc-th column of C. Using this into £(q, A), 
we can solve the dual problem by minimizing £(q(A), A) with 
respect to A > 0. It is immediate to check that for any. A' > 0, 

/:(q(A'),A') > C{q{X),\') 

= £(q(A),A) + (l-Cq(A))T(A'-A) 

(20) 

Therefore, s(A) = (1 — Cq(A)) is a subgradient for 
£(q(A),A). It follows that the dual problem can be solved 
by a simple L-dimensional subgradient iteration. 



Step 2. Let {q^} denote the output of Step 1 for fixed 
steering vectors {t^}. It follows that, by construction, ~ 
y/qkg.\ftk- In this step we fix {ofc} as given above and search 
for the steering vectors that maximize a common power scaling 
factor 77. Using ( [T6| we obtain the optimization problem 



maximize 
subject to: 
with solution readily given by 



— tr(TT^*£) < 1 



(21) 



where B is the solution of 



minimize 



subject to: 



-tr(TTH*f) <u \f£ 



(22) 



and where T and B are related by ( 16 1. It is recognized that 
( p2] i is a SOCP with respect to the variables u and B and can 
be solved by standard efficient tools [15]. 

The output of Step 2 is a new set of steering vectors in 
the form — ?7[gfeafc + U-'-bfe]. These can be used as 
new fixed steering vectors for Step 1, and so on. With the 
proposed initialization, at the first round of Step 1 we obtain 
the optimal solution based on the pseudo-inverse steering 
vectors. Then, the algorithm finds a generalized inverse that 
improves upon the pseudo-inverse akeady after one iteration. 
Although It is known (see [10]) that the pseudo-inverse ZFBF 
is optimal under the sum-power constraint, under general 
linear constraints it may be very suboptimal. 

Fig. [3] illustrates the convergence behavior of the proposed 
iterative algorithm for ZFBF under general linear constraints. 
Channel and constraint parameters are the same as in Figs. [T| 
and |2] The proposed algorithm for ZFBF satisfies the given 
sum-power and interference power constraints as in DPC 
cases. The circles indicate the iterations at which the steering 
vectors update (step 2) is performed, i.e., when the power 
optimization of step 1 begins with the new set of steering 
vectors. Since the steering vectors are initialized with the 
pseudo-inverse directions, the performance of pseudo-inverse 
ZFBF is given at the end of the first round of step 1, i.e. 
right on the left of the second circle in the plots (iteration 
number 39 in the plot). We notice that the pseudo-inverse 
ZFBF is markedly suboptimal in this case, in fact, the transmit 
sum power constraint is not met with equality. This means 
that if one insists on pseudo-inverse steering vectors the 
transmitter has to back off its transmit power in order to 
meet the interference constraints. Instead, our algorithm finds a 
generalized inverse that yields a significant improvement and, 
in this case, meets all constraints with equality. 
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